Beating Henry Higgins at His Own Game: A Markovian Approach to Dialectology

نویسندگان

  • Richard Chung
  • Lav Varshney
چکیده

1. Introduction The performance of speech recognition algorithms degrades considerably due to speaker variability. Aside from gender, the largest cause for speaker variability is accent. If the accent of a speaker can be determined automatically, then accent-specific speech recognition models can be used, thereby increasing speech recognition accuracy. In this study, the problem of accent classification from a database of Central New York (CNY), North India (IND), and Singapore (SIN)-native English speakers is considered. Linguists have studied regional accents extensively and have described many features particular to the three regions we are considering [1]. The problem of automatic accent classification has received attention within the last decade. Vonwiller, Blackburn, and King used artificial neural networks (ANN) for automatic accent classification [2]. Hansen and Arslan found that energy, duration, and spectral information are good features for accent detection, and that the most distinct features of accent are at the phonemic level. They also formulated a Hidden Markov Model (HMM) classification algorithm [3]. Arslan and Hansen further investigated the HMM for accent classification, using three different scenarios: isolated word – full search, continuous speech – full search, and continuous speech – partial search. In all scenarios, a left-to-right HMM topology with no skip states was used. In tests using speech samples from native speakers of American English, Turkish, Chinese, and German, the HMM algorithm was found to classify better than human classifiers [4]. Teixeira, Trancoso, and Serralheiro applied a parallel set or ergodic nets with context independent HMM units to the problem of English accent identification of speakers from 6 different European countries [5]-[6]. Kat and Fung used phoneme-class based HMMs, rather than phoneme-based HMMs for classification [7]. Chen et al. use a Gaussian mixture model (GMM) to classify different accents in Chinese [8]. We have decided to use two different learning systems to perform the task of accent classification. The first is a HMM with a left-to-right phoneme-based topology. The second is a k-nearest neighbor (KNN) algorithm. We will compare the performance of these two learning systems with the performance of human classifiers.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Panel Discussion on Computing and the Humanities

This is the report of a panel discussion held in connection with the special session on computational methods in dialectology at Methods XIII: Methods in Dialectology on 5 August, 2008 at the University of Leeds. We scheduled this panel discussion in order to reflect on what the introduction of computational methods has meant to our subfield of linguistics, dialectology (in alternative division...

متن کامل

Two-tier Supplier Base Efficiency Evaluation Via Network DEA: A Game Theory Approach

In today's competitive markets, firms try to reduce their supply cost by selecting efficient suppliers using different techniques. Several methods can be applied to evaluate the efficiency of supplier base. This paper develops generalized network data envelopment analysis models to examine the efficiency of two-tier supplier bases under cooperative and non-cooperative strategies where each tier...

متن کامل

Details of Vision of 132 Cases of Intracapsular Extraction of Cataract

Mr. Eason in the July number of the Lancet, 1911, writes a paper on Cataract Extraction, the basis of which is not his own personal experience but a paper published in the Royal London Ophthalmic Hospital Reports, Volume 16, Part 3, October 1905, by Mr. E. Treacher Collins and a paper by Mr. Charles Higgins {Lancet, 13th April 1907), and a paper published by Major Kilkelly in the Indian Medical...

متن کامل

MODELING RISK OF LOSING A CUSTOMER IN A TWO-ECHELON SUPPLY CHAIN FACING AN INTEGRATED COMPETITOR: A GAME THEORY APPROACH

In a competitive market, customer decision is made to maximize his utility. It can be assumed that risk of losing a supply chain’s customer can be defined based on products utility from customer point of view. This paper takes account of product price and service level as competition criteria. The proposed model is based on non-cooperative game theory, for one-manufacturer and one-retailer supp...

متن کامل

Identifying Characteristics of a National Socialist: Germany and Alfred Wittmann

Acknowledgements At this moment, I would like to thank my Honors Thesis Coordinator Dr. Stephen Fritz for his continuous support with my Honors Thesis. Throughout my undergraduate career, lectures given by Dr. Fritz have inspired me to engage in more research and to ask more questions. I would also like express gratitude to my Thesis Readers Dr. O'Brien and Dr. Achilov for taking their time to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003